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ABSTRACT 

An overview of preliminary work into the development of 
a new pedagogical approach for sound recording and 
music production, inspired by developments in the field of 
Music Information Retrieval (MIR) and Intelligent Music 
Production (IMP), using multitrack audio archives, 
objective audio features, and new visualization tools. 

1. INTRODUCTION 

Music production inherently involves many complex 
technical and artistic decisions made by the sound 
engineer. These decisions are the result of extensive 
training and years of practice to improve the acuity of 
technical skills and critical listening. 

It has been shown that ear training methods are effective 
for improving identification, recognition and reproduction 
of frequencies (equalization) [1-3], dynamics processing 
[3] and artificial reverberation [3-5]. Appropriately, these 
training programs deal with the skills in isolation and in 
the majority of cases, for single sources. More often than 
not, this is not the situation facing a sound engineer in 
practice. The development of higher level, more complex 
and integrated listening skills (multitrack recording, 
multitrack mixing) is most often taught using critique or 
apprenticeship methods 1 ; the student either observes a 
mentor or produces work for critique and evaluation, 
repeating this until they become proficient at the work. 

The fields of MIR and IMP utilize many different types of 
audio signal features [6], for a variety of different tasks. 
The audio being analyzed can be either the composite 
master mix, or the multitrack source material. The goal 
may be to create an automatic mixing system or to classify 
musical genres. In both cases, the original audio 
engineer’s technical and artistic decisions - the result of 
the training and experience of that engineer - are in some 
part represented in the measured audio features necessary 
for these tasks. The focus of this work is to explore the 
extent to which this experience and training can be 
objectively measured, visualized and then used to inform 
the training of audio engineering and music production 
students. 


1 The decline of the large studio and in some cases, geographic location, 
can severely limit internship/apprentice opportunities for contemporary 
audio engineering and music production students [8] and is another 
contributing influence on this work. 
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2. METHODOLOGY 

The audio used for analysis is taken from a comprehensive 
collection of recorded materials created by the Canadian 
art-rock band, KLO (1979-86). The collection contains two 
album recordings, one done at Skyline Studios in New 
York, and the other at CD Records in San Francisco. The 
recordings include multiple takes of individual songs, and 
versions of the same song, which where recorded at both 
studios. Comparative analysis is possible using the 
progression of takes for a given song and between 
production personal. Analogue masters tapes were 
transferred and digitally archived to .wav format (96kHz, 
24bit). 

2.1. Tempo Analysis 

Tempo was chosen to be the test case for a music 
production element - the ability of an engineer or producer 
to identify positive vs. negative tempo variation in a 
musical performance. Tempo extraction was done 
manually per bar for songs where multiple takes existed. 
This was then plotted with tempo vs. measure number, as 
shown in Figure 1. 



Figure 1. Tempo Map of Lulu, take 1 vs. 2. 


A histogram was generated from this same data and is 
shown in Figure 2. 



Figure 2. Histogram of Lulu, take 1 vs. 2. 
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2.2. Feature Analysis 
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The Max descriptors- object was used for all audio 
analysis; the list of audio signal features used is shown in 
Table 1. In this preliminary work, kick and snare sounds 
were analyzed for one song. 


Table 1. Audio signal features used in analysis. 


Feature 

Label 

Ref. 

Spectral Crest 

spectralcrest 

[6] 

Spectral Centroid 

logcentroid 

[6] 

Spectral Spread 

log_ s pread 

[6] 

Spectral Kurtosis 

logkurtosis 

[6] 

Spectral Brightness 

log_brightness 


Loudness 

loudness 

f61 


2.3. Preliminary Testing 

Using this analysis, a test session was run over the course 
of three days with six audio practicum students at the 
Banff Centre. Students were masters-level or post- 
graduate, early career professionals with International 
backgrounds (three) or Canadian nationals (three). 
Listening sessions, augmented with discussion about the 
measured audio signal features and visual aids, as shown 
in Figures 1 and 2, were used to inform a recording 
session where original drum tracks were re-recorded for 
one song from the archive. In this session, an emphasis 
was placed on drumming performance (timing) and the 
kick and snare drum recording above all other factors. 

3. RESULTS 

Table 2 shows the responses from a follow-up student 
experience survey. Students were asked to rank the 
following questions from 1-5 (l=very poor, 2=poor, 
3=adequate, 4=good, 5=excellent). 

1 . This teaching approach helped with your 
understanding of popular music production 

2. This teaching approach provided relevant skills 
and information for popular music production 

3 . This teaching approach offered and effective 
learning experience. 

4. The materials provided for learning were clear 

The results reported here are representative of written 
responses from the students to additional questions 
regarding the perceived strengths and challenges of this 
teaching approach. 


Table 2. Responses to student experience survey, number of 
respondents = 5. 


Question 

Mean 

Median 

StDev 

1 

4.25 

4.5 

0.96 

2 

4.5 

4.5 

0.58 

3 

4.25 

4 

0.5 

4 

4.5 

4.5 

0.58 
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Following the student session, analysis was performed on 
the resulting kick and snare drum recordings and 
compared to the original Skyline recording. Table three 
shows the raw measurements using the descriptors- 
object, with a third kick drum included for further 
comparison. 


Table 3. Audio feature analysis of three kick drum recordings. 


Feature 

Skyline 

Banff 

CDRecords 

spectral_crest 

28.50 

29.47 

23.40 

log_centroid 

875.98 

97.13 

4472.78 

log_sp re ad 

4.39 

6.83 

3 

log_kurtosis 

4.27 

3.64 

2.48 

log_brightness 

4.61 

0.71 

7.66 

loudness 

-71.52 

-66.92 

-70.46 


4. CONCLUSION 


Results from a student experience survey indicate that this 
pedagogical approach was both positive and informative to 
a variety of students, all having very different educational 
backgrounds and training. There is some evidence that 
students were able to use the discussion of audio feature 
analysis to focus their listening and make decisions 
regarding kick and snare drum recording, in order to 
emulate original recording quality. 

5. LUTURE WORK 

The automation of tempo analysis is necessary for the 
success of this pedagogical approach. A lager data set of 
exemplary kick and snare drum sounds will be undertaken 
in order to better understand and interpret the preliminary 
results presented here. Extending this approach to include 
other elements of music productions e.g. vocals, is the 
long-term goal of this work. 
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